Picture for Juntao Li

Juntao Li

When to Trust Tools? Adaptive Tool Trust Calibration For Tool-Integrated Math Reasoning

Add code
Apr 09, 2026
Viaarxiv icon

Flux Attention: Context-Aware Hybrid Attention for Efficient LLMs Inference

Add code
Apr 08, 2026
Viaarxiv icon

When Is Thinking Enough? Early Exit via Sufficiency Assessment for Efficient Reasoning

Add code
Apr 08, 2026
Viaarxiv icon

LongFlow: Efficient KV Cache Compression for Reasoning M

Add code
Mar 12, 2026
Viaarxiv icon

Where Matters More Than What: Decoding-aligned KV Cache Compression via Position-aware Pseudo Queries

Add code
Mar 12, 2026
Viaarxiv icon

MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Add code
Jan 24, 2026
Viaarxiv icon

Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers

Add code
Jan 24, 2026
Viaarxiv icon

$\texttt{MemoryRewardBench}$: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Add code
Jan 17, 2026
Viaarxiv icon

Accelerate Speculative Decoding with Sparse Computation in Verification

Add code
Dec 26, 2025
Viaarxiv icon

Overview of CHIP 2025 Shared Task 2: Discharge Medication Recommendation for Metabolic Diseases Based on Chinese Electronic Health Records

Add code
Nov 09, 2025
Viaarxiv icon